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CARCINOGEN ASSAY 



PCT/US99/23579 



TECHNICAL FIELD OF THE INVENTION 
The present invention is drawn to the field of biotechnology and more 
5 specifically to biomedical applications. Further, this invention relates to novel assays 
for testing whether a compoimd is a carcinogen. Differential patterns of gene 
expression for test chemical treated and untreated manunalian cells are compared to 
those patterns established for known carcinogens. 

10 BACKGROUND OF THE INVENTION 

The current "gold standard" for testing whether a compound is carcinogenic is 
performed with both sexes of two animal species that are treated chronically for two 
years with a dose of compound at or near the maximum tolerated dose. The end point 
of the assay is tumor formation. These assays are both expensive and laborious. The 

1 5 aim of this invention is to greatly shorten the time required to determine whether a 
chemical compound is a carcinogen. 

Carcinogens act upon a cell in several ways, with the common result by 
definition being induction of neoplasia or tumor formation. Carcinogens may be 
chemical or biological agents, Chemical carcinogens include, for example, 

20 compounds of: DMBA; benzo(a)pyrene; dibenzanthracene; acetoxyaminofluoren, 

methyl N. nitrosogreoxidine; nitroquinoline; dibromomethane; dibromoethane; fiiran; 
and 12-0-tetradecanoyil-13-phorbol-acetate (TPA). Biological carcinogens include 
oncogenic viruses, for example, such as members of the family of papilloma virus. 
Chemical compounds and biological agents have the shared property that they arc 

25 each able to cause mutations of the genetic material of an animal cell or otherwise 
alter the level of expression of non-mutated genes. Despite the different mechanisms 
of action of carcinogens, these compounds each produce changes in gene expression 
that occur in the transition from normal to tumor cell. Changes in the transcription 
level may be detected by differential displays of mRNA isolated from a ceil treated 

30 with a carcinogen. The changes in the level of translation of the mRNA may also be 
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detected in the differential display of proteins or peptides synthesized in the treated 
cell. 

Differential display of mRNA has been articulated as a means of evaluating 
the gene expression profile of a cell for the identification of cancer related genes and 
5 elucidation of their role in tumorigenesis. For example, more than 100 candidate 
tumor suppressor genes have been described. (Sager, R., Proc. Natl. Acad. Sci. USA 
(1997) 94:952-955). Some of these have been identified using differential display of 
cDNA prepared fi-om isolated mRNA. While more than 1 00 oncogenes have been 
identified in animal cells, only a small subset have been found consistently as mutated 

10 genes in human cancer (Copper G. M. (1995) Oncogenes; Jones & Bartlett, 

Seedbury); Weinberg, R.A. Sci. Am. (1996) 275:62-70; and Bishop. J.M. Cell (1991) 
64:235-248). The progression of neoplasia development and transformation into 
malignant tumors is not well understood. However, since cancer genes are defined by 
their phenotypic expression, research into their differential phenotypic expression has 

15 been undertaken as a means to fiirther elucidate the process of carcinogenic induction 
of tumorigenesis. 

For example, carcinogen-mediated changes in cell metabolism may 
differentially alter the expression of oncogenes, tumor suppressor genes, cell cycle 
regulators, DNA replication genes (e.g., encoding repair enzymes, receptors), growth 

2 0 factors, stress proteins or known tuinor markers. Such differential gene expression 

may be represented by an increase or decrease of levels of expression or may result in 
modifications to the gene product expressed, that is, altered forms of proteins, which 
function differently than proteins in normal cells (e.g., greater or lesser binding 
affinity; requirement or lack of requirement of co-factors). 
25 Many studies have addressed the changes in genes (mutations) and the 

changes in gene expression in timiors. These alterations in sequence and synthesis 
have also been examined as markers of tumorigenesis. However, no one has 
examined the early changes in gene expression in the first days or weeks after the 
beginning of treatment with a test compound to determine whether a subset of the 

3 0 changes that occur early in carcinogen treatment are useful to predict whether a tested 

compound is a carcinogen. The pattern that is developed need not be similar to that in 
the tumor but need only be predictive of tumorigenic potential. 



wo 00/22168 PCT/US99/23S79 
Seven cDNA fragments of gene transcripts that were overexpressed in 
mammary carcinomas in the rat model induced by 1 -methyl- 1 -nitrosourea have been 
identified by differential display of mRNA and molecular cloning (Lu, J., et al, 
Molecular Carcinogenesis (1997) 20:204-215). However, this research involved the 
5 comparison of the latter stage of tumori genesis (2 months following chemical 

carcinogenic induction) with control mammary tissue. Lu et al. did not disclose any 
methods for the differential display of gene transcript fragments associated with the 
earlier stages of tumorigenesis, that is, prior to actual tumor formation. 

Similarly, there are numerous reports in the literature of the identification and 
1 0 role of various oncogenes and tiunor suppressor genes. For example, the use of 
differential display to study gene expression during development of squamous cell 
carcinomas and the important role played by c-fos (or v-fos) in the transformation to 
the malignant phenotype in keratinocytes has been reported (Rutberg, S., et al,. 
Molecular Carcinogenesis (1991) 20.SS-9S). Rutberg eM/., asdidLu e/ a/., used 
1 5 the latter stage endpoint of tumor formation in the comparison of differential gen': 
expression in establishing the role of oncogene transformation of epithelial tissue. 

There is a clear need for assays which predict at an early stage whether an 
unknown compound is a carcinogen. Current assays are designed to determine 
whether an agent is carcinogenic by using one or more surrogate markers. For 
2 0 example, the widely used Ames test represents an analysis of the effects of a test 

compound upon the frequency of reverse mutation of bacterial test strains, such as the 
histidine auxotrophic strains of Salmonella typhimurium (Manual of Methods for 
General Bacteriology, Gerhardt et al, American Society for Microbiology Press, 
Washington, D.C. (1981) at page 237). This test is not absolute, since a compound 
2 5 under this assay may be mutagenic and not be carcinogenic or alternatively a 
compound may not evidence mutagenicity in this assay but be carcinogenic as 
determined using other tests. 

Current short-term assays are based mostly on correlation between mutagenic 
• and carcinogenic potential of compound that was established in earlier studies. This 
30 correlation is exploited directly in Salmonella mutation assay (Ames test), in the 
mouse tk*'' assay, and CHO hprf assay. There are also assays based on abilities of 
carcinogens to interact with DNA or chromatin directly, like DNA adducts assay and 
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single cell gel assay for DNA strand breaks. The mechanisms underlying other assays 
are less clearly defined, like for micronucleus assay, and cytogenetic assays (FISH, 
aneuploidy or sister chromatid exchange assays) but are thought to reflect at least in 
part, the disturbance of the DNA or chromosome organization. These assays show 
5 good sensitivity for mutagenic carcinogens but as a rule cannot detect effectively 
activity of the nongenotoxic carcinogens. 

Recently a new test, the Syrian Hamster Embryo (SHE) Cell Transformation 
assay was developed and refined, which is based on the ability of carcinogens to 
induce immortalization and transformation (morphological and malignant) of the SHE 

10 cells in culture (Isfort and LeBoeuf, Mutation Research (1996) 365:161-173). This 
assay shows good sensitivity and accuracy for mutagenic carcinogens as well as 
nongenotoxic carcinogens. 

Application of the battery of mentioned short-term tests increases the validity 
of prediction of whether compounds may be carcinogens. However, tumorigenic 

1 5 transformation of a cell in an organism is a result of the complex interaction of the 
cells with a carcinogen (possibly metabolically activated) and with other cells of the 
organism. Hence, one should expect that cells may react to a tested compound 
depending on the cell's phenotypic characteristics, that is, for example, tissue-specific 
characteristics. Cells of different tissue-specific characteristics may react to the same 

2 0 compound with different outcomes. Considering possible differences in cell response, 
the euoimous variety of industrial, pharmacological, and therapeutic compounds that 
need to be tested, and fiirther considering the multitude of activation pathways, there 
is a clear need for a short-term carcinogenicity assay that allows for the evaluation of 
a cell's response to a compound in different tissues of an organism. 

2 5 Another more recent approach to assaying the effects of test compounds, at 

least indirectly upon a cell's genome, employs cells transfected with genetic 
constructs comprising selected "stress promoters" that are induced by specific stress 
factors, such as "DNA stress" which includes damage to DNA strands by chemicals or 
UVA radiation (Farr, S., et ai, BP 0 680 5 17 (1997)). In this assay, the transformed 

30 cell is exposed to potentially toxic compounds and the levels of transcription of one or 
more heterologous reporter genes, each of which is operatively linked to its respective 
stress promoter, is assayed for transcription by hybridization of probes specific to the 
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mRNA of each reporter gene. This assay indirectly measures whether a potentially 
toxic compound will induce a stress promoter to permit transcription of its reporter 
gene. Fair et al. discloses the creation of a "stress induction profile" for each 
compound by classifying which stress promoters are induced for each compound 
5 tested. This method indirectly measures toxicity by the use of artificial gene 
constructs of selected "stress promoters" and selected reporter genes as they are 
transfected into specific host cells. Further, this method measiires through 
hybridization whether an mRNA transcript of a reporter gene is expressed by a stress 
promoter that responds to toxic agents. Since the stress promoter is fused out of 

1 0 context to the reporter gene, this method does not measure the functionality of the 

transcript or translated protein product. Since Farr et al. transformed HepG2 cells in 
their assay, these artificial gene constructs may provide different outcomes if the 
constructs are used to transfonn different host cells or host cells of different tissues. 
Still wanting is a method that measures differential expression of native genes on a 

1 5 wider scale and in the short period following exposure of the target cell to the test 
compound. 

The high-throughput methods (based mostly on using high-density gene 
arrays) developed in recent years, will be exploited to establish patterns of gene 
expression changes that are diagnostic for response to known carcinogens of the 
2 0 different mouse tissue cells. These patterns will serve as a basis for a new short-term 
assay for detecting carcinogenic potential of chemical and biological compounds. 

A high-capacity quantitative method of differential gene expression that uses 
hybridizable cDNA arrays robotically printed on glass was reported by Schena, M., et 
al. Science (1995) 270:467-470. This method fixes to a substrate amplified cDNA 

2 5 prepared fi'om both complete gene sequences and firom partial sequences or expressed 

sequence tags (ESTs). Fluorescent hybridization probes were prepared ftom totai 
mRNA and used to identify corresponding cDNA. This methodology was able to 
clearly identify the overexpression of a single target gene. 

In contrast, currently available assays for carcinogenesis either operate in 

3 0 bacterial test systems, which do not accurately reflect mammalian cellular response, or 

require a significant amount of time to demonstrate phenotypic expression of 
carcinogen activity in mammalian whole animal models. The present invention is 
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able to quickly identify carcinogen activity by effects upon differential gene 
expression in mammalian cells or animals at a point in time prior to tumor formation 
by comparing these differential levels of expression with prior detemiined patterns of 
differential genetic expression caused by known carcinogens. 

5 

SUMMARY OF THE INVENTION 
The present invention provides an assay for the early detection of 
carcinogenesis by measuring the net changes in gene expression that result from the 
treatment of a mammalian cell or whole animal with a test agent over those of a 

1 0 control cell or control whole animal further compared with previously determined 
reference patterns of genetic expression altered by known carcinogens. The present 
invention further provides methods and kits to identify a carcinogen. The present 
invention also provides for methods for identifying DNA and proteins related to 
carcinogenesis. Additionally, the present invention contemplates the DNA and 

1 5 proteins related to carcinogenesis that are identified by the methods and assays of the 
present invention and that are predictive that a test agent or substance is a carcinogen. 

More specifically, one embodiment of this present invention provides assays 
and methods comprising the steps of: a) preparing cDNA from RNA isolated firom 
mammalian cells treated with a test agent for less than about two years, wherein said 

2 0 cCNA is detectably labeled; b) deteimining a pattern of expression by assessing the 
ability of said labeled cDNA to hybridize to each of a multiplicity of nucleotide 
sequences arrayed on a substrate; c) detennining reference patterns of differential 
gene expression by assessing the ability of a second labeled cDNA to hybridize to 
each of a multiplicity of nucleotide sequences, wherein said second labeled cDNA is 

2 5 prepared ftom RNA isolated from mammalian cells treated with a known carcinogen 

for less than about two years; and d) comparing similarities in patterns of expression 
from cells treated with a test agent to reference patterns from cells treated with a 
known carcinogen. 

A preferred embodiment provides that RNA is isolated from mammalian cells 

3 0 treated with a test agent or known carcinogen for less that about six months. A further 

preferred embodiment provides that RNA is isolated from mammalian cells treated 
less than about two weeks or more preferably treated for less than about two days. • 
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In another preferred embodiment, the test cDNA may be labeled with a 
fluorescent molecule emitting one wavelength of light and the control cDNA labeled 
with a second fluorescent molecule emitting a second wavelength of light. The 
hybridization results that form a pattern of expression and clearly represent effects of 
5 treatment of mammalian cells with a test agent or substance will always thus be 

controlled internally. Other preferred embodiments include the use of other similarly 
detectable labels, that include but are not limited to: radioisotopes; enzymes; antigens 
or antibodies. 

Another embodiment of the present invention is a method to identify a test 

10 substance as a carcinogen, which method includes: a) providing cDNA derived from 
manunalian cells that have been contacted with the test substance for less than about 
two years; b) determining a first pattern of hybridization of the cDNA with respect to 
a multiplicity of nucleotide sequences; and c) comparing the pattern with a 
corresponding second pattern obtained at a corresponding time after the cells have 

1 5 been contacted with a known carcinogen, whereby a similarity in the first and second 
patterns identifies the test substance as a carcinogen. A preferred embodiment 
provides that the cDNA is obtained firom cells that have been contacted with the lest 
substance or a known carcinogen for less that about six months. A further preferred 
embodiment provides that the cDNA is obtained from cells contacted for less than 

2 0 about two weeks or more preferably contacted for less than about two days. 

Another aspect of the present invention provides an assay kit for identifying a 
test substance as a carcinogen, compiising: a) reagents for the preparation of cDNA 
prepared from RNA isolated from mammalian cells treated with a test substance for 
less than about two years; b) reagents for the preparation of cDNA prepared from 

2 5 RNA isolated from mammalian cells treated with a known carcinogen for less than 
about two years; c) DNA sequences arrayed on a substrate that are hybridizable to the 
cDNA; and d) label suitable for the detection of the DNA sequences hybridized to the 
cDNA; wherein the identification of a test substance as a carcinogen is achieved by 
creating a first pattern of differential gene expression using cDNA prepared from the 

30 RNA isolated from mammalian cells treated with a test substance for less than about 
two years and comparing the first pattern with a corresponding second pattern 
obtained at a corresponding time after the cells have been contacted with a known 
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carcinogen, whereby a similarity in the first and second patterns identifies the test 
substance as a carcinogen. 

Yet another aspect of this invention provides an assay for early detection of 
carcinogenesis, comprising the steps of: a) selecting a protein isolated firom 
5 mammalian cells treated with a test agent for less than about two years; b) measuring 
differences in protein synthesis or post-translational modification; and c) determining 
whether the test agent is carcinogenic by comparing differences in protein synthesis or 
post-translational modification to protein isolated fi-om mammalian cells treated with 
a known carcinogen for less than about two years. In a preferred embodiment, this 

10 invention provides that the differences in protein synthesis or post-translational 
modification are measured by electrophorectic gel analysis and that the protein is 
detectably labeled. Such detectable labels, include but are not limited to: a 
fluorescent dye; radiolabelled compound; or enzymatically detectable label, for 
example, horse radish peroxidase or alkaline phosphotase. 

1 5 One other embodiment of the present invention provides that differences in 

protein synthesis or post-translational modification are measured by enzymatic 
digestion of protein and mass spectrometry analysis. 

In another embodiment, this invention contemplates that the mammalian cells 
are treated with a carcinogen in vitro. In still another embodiment, this invention 

20 contemplates that the mammalian cells are treated with a carcinogen in VIVO. Still 
another preferred embodiment, this invention provides that the differential gene 
expression is selected Srom the group consisting of genes whose level of expression is 
increased; genes whose level of expression is decreased; genes that are expressed and 
genes that are not expressed. 

2 5 Another aspect of the present invention provides for methods of identifying 

genes, genetic sequences and proteins related to carcinogenesis as well as the genes, 
genetic sequences and proteins identified by the methods of this invention. A fiiicher 
aspect of the present invention provides for the reference patterns of differential gene 
expression of mammalian cells treated with a known carcinogen. Another 

3 0 embodiment of the present invention provides for protein measure by any one of the 

methods disclosed. A still fiirther aspect of the present invention includes DNA or a 
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DNA sequence as used in any of the methods provided herein, where the DNA or 
DNA sequence is predictive that a test agent or test substance is a carcinogen. 

Additional aspects of the present invention relate to the use of substrates as 
contemplated herein. A preferred embodiment of the present invention provides for a 
5 multiplicity of nucleotide sequences arrayed on a substrate. A still further preferred 
embodiment contemplates substrates selected from the group consisting of: membrane 
filter; glass; ceramic; and solid organic polymer. 

The present invention further contemplates the use of one or more assays or 
methods as discussed supra and further disclosed herein used in combination with one 

10 another. For example, a preferred embodiment includes the steps of: preparing a first 
cDNA from a first mRNA; isolated from mammalian cells treated with a test agent for 
less than about two years, wherein the first cDNA is detectably labeled; determining a 
pattern or expression by assessing the ability of the labeled cDNA to hybridize to each 
of a multiplicity of nucleotide sequences arrayed on a substrate; detennining reference 

1 5 patterns of differential gene expression by assessing the ability of a second labeled 
cDNA to hybridize to each of a multiplicity of nucleotide sequences, wherein the 
second labeled cDNA is prepared from a RNA isolated from mammalian cells treated 
with a known carcinogen for less than about two years and comparing similarities in 
patterns of expression from cells treated with a test agent to reference patterns from 

2 0 cells treated with a known carcinogen. This assay further comprises the steps of: 

selecting a protein isolated from the same mammalian cells treated with the same test 
agent for less than about two years; measuring differences iii protein synthesis or post- 
translational modification; and detennining whether the test agent is carcinogenic by 
comparing differences in protein synthesis or post-translational modification to 

2 5 protein isolated from mammalian cells treated with a known carcinogen for less than 

about two years. In a preferred embodiment, this invention provides that the 
differences in protein synthesis or post-translational modification are measured by 
electrophorectic gel analysis and that the protein is detectably labeled. 

Citation of the above documents is not intended as an admission that any of 

3 0 the foregoing is pertinent prior art. All statements as to the date or representation as 

to the contents of these documents is based on the information available to the 
• applicants and does not constitute any admission as to the correctness of the dates or 
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contents of these documents. Further, all documents referred to throughout this 
application are incorporated in their entirety by reference herein. 



BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows changes in gene expression in TPA-Treated and calcium 
differentiated mouse keratinocytes. 

Figure 2 shows P & T Primers used in polymerase chain reaction. 

DETAILED DESCRIPTION OF THE INVENTION 

The above and various other objects and advantages of the present invention 
are achieved by preparing a differential display of genetic expression for cells treated 
with a test agent compared with reference patterns of differential gene expression of 
mammalian cells treated with one or more carcinogens. 

Unless defined otherwise, all technical and scientific terms used herein have 
the same meaning as commonly understood by one of ordinary skill in the art to 
which this invention belongs. Although any methods and materials similar or 
equivalent to those described can be used in the practice or testing of the present 
invention, the preferred methods and materials are now described. All publications 
and patent documents referenced in this application are incorporated in their entirety 
herein by reference. 

For example, what is meant by carcinogenesis is the origin or initiation or 
production of neoplasia or tumor formation. Neoplasia is meant to include both 
benign and malignant growth of new tissue. Such neoplasia may include, but is not 
limited to: carcinoma, sarcoma, lymphoma, blastoma, and mesothelioma. A test agent 
is any molecule, compound or biological organism that is selected to be used with the 
methods, assays or other embodiments of the present invention. The test agent may 
be modified or altered to facilitate carcinogenicity. Further, the use of the term 
carcinogen is intended to include carcinogenic agent. Mammalian cells include both 
cultured cell lines as well as cells isolated from mammalian tissues. 

Conditions for the hybridization of DNA are readily available to one in the art. 
Such conditions include those as provided in Denhardt et al.JMol Biol 12(3):641-646 
(1965) or in Dahlberg, et ai, (U.S. Patent 5,691,142), which is incorporated in its 
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entirety herein by reference. The detection of specific nucleic acid sequences or a 
muhiplicity of nucleotide sequences as present in oligonucleotides has been achieved 
typically by hybridization. Hybridization methods involve the annealing of a 
complementary sequence to the target nucleic acid (the sequence to be detected). The 
5 ability of two polymers of nucleic acid containing complementary sequences to find 
each other and anneal through base pairing interaction is a well-recognized 
phenomenon. The initial observations of the "hybridization" process by Marmur and 
Lane, Proc. Natl. Acad. Sci. USA 46:453 (1960) and Doty et al., Proc. Natl. Acad. Set. 
USA 46:461 (1960) have been followed by the refinement of this process into an 

10 essential tool of modem biology. Initial hybridization studies, such as those 

performed by Hayashi et al, Proc. Natl. Acad. Sci. USA 50:664 (1963), were formed 
in solution. Further development led to the immobilization of the target DNA or RNA 
on solid supports. With the discovery of specific restriction endonucleases by Smith 
and Wilcox, 7. Mol. Biol. 5 1 ;379 (1970), it became possible to isolate discrete 

15 fragments of DNA. Utilization of immobilization techniques, such as those described 
by Southern, J. Mol. Biol. 98:503 (1975), in combination with restriction enzymes, 
has allowed for the identification by hybridization of single copy genes among a mass 
of fi:xictionated, genomic DNA. 

What is meant by differential gene expression is, for example, but is not 

2 0 limited to, a method to evaluate the expression of the genes of a cell. Such differential 

gene expression may be represented as an increase or decrease in the levels (including 
non-detectable levels) of transcription of genes by the synthesis of mRNA. 
Differential gene expression may also be represented as an increase or decrease in the 
levels (including non-detectable levels) of translation of mRNA coding for a gene in 
25 the synthesis of a protein or peptide. Also, differential gene expression may include 
post-translational modification of the protein or peptide sequence as well as mutated 
forms of the protein or peptide gene product. 

What is meant by reference patterns of differential gene expression is the 
physical manifestations of differential gene expression. Such patterns may take the 

3 0 form of films or gels or data records or other tangible representations of 

hybridizations of genetic fragments or levels of protein or peptide synthesis that are 
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created by determining the changes in genetic expression of mammalian cells that 
have been treated with known carcinogens. 

Wh&t is meant by referring to mammalian cells as pre-tumorous cells is that 
the cells have not yet demonstrated tumor formation or unregulated growth. 

5 

I. Differential Genetic Expression Array 

The present invention contemplates that the reference pattems of differential 
gene expression of this assay could include as few as several sequences and up to 
thousands. The selection of genes may be made by simply choosing known 

1 0 oncogenes, tumor suppressor genes, cell cycle regulation genes, and DNA replication 
genes (encoding repair enzymes, receptors) growth factor genes, stress proteins or 
known tumor marker genes. While one may determine to screen all known EST's in 
databases known to the art, it is more likely that sufficient sequences, to defme a 
pattern or pattems of gene synthesis change, will be found by only screening a 

1 5 fraction of the known sequences. It is not necessary to detect every change but rather 
to detect a number of changes that are sufficient to establish a reference pattern that is 
useful in distinguishing a carcinogen from other non-carcinogenic compounds. The 
number of sequences that are contemplated for selection may range between three tc 
1000 and preferably from 10 to 200 and more preferably from 20-100. 

2 0 The present assay will detect changes in gene expression early in a treatment 

protocol. These changes may occur in subsequent tumors caused by the carcinogen 
but this is not a necessary finding. The important changes may be transitory and our 
exp^ments are designed to detect them. We anticipate that the important changes 
occur in treated animals from as few as a single treatment to five doses per week for 
25 up to 90 days. We Wish to shorten the time that animals are treated as much as 
possible. 

Initially test cDNA is prepared from RNA obtained from tissue of animals 
treated with a test agent, where in order to create reference pattems, test agents will be 
carcinogens. The reference will be cDNA prepared from RNA of control animals of 

3 0 the same age, sex, and protocol that have not been treated with a test agent but with 

the vehicle alone. An ideal version of the assay could be the selection of a set of cell 
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lines that show a pattern of gene synthesis, which is a diagnostic of a carcinogen. 
This would eliminate the use of animals completely for our assays. 

II. Use of cDNA hybridization patterns 
5 Once the set of gene sequences to be used in our tests has been selected we 

contemplate that assays may be performed using one of several formats. (1) Gene 
sequences to be tested could be immobilized on a substrate, such as silicon, glass or 
other suitable surface or sequences may be synthesized in place on the surface. Many 
such substrates are commonly referred to as chips or gene chips. The first cDNA 

10 probe, labeled with one detectable label such as a fluorescent molecule, is prepared 
firom RNA extracted fi-om a tissue from a treated animal. The second cDNA probe 
prepared from KNA extracted from the same tissue from a matched control animal 
and labeled with a second detectable label, such as a second fluorescent molecule. 
These probes are hybridized to the cDNA or oligonucleotides on the chip. After 

1 5 hybridization, the relative amount of the detectable labels, for example the 

fluorescence, is measured over each spot of cDNA and this measurement is corrected 
for housekeeping cDNA sequences on the same chip. Housekeeping cDNA 
sequences are used as a control to normalize data. The differences in synthesis, either 
increased or decreased relative to the control, define the pattern of synthesis or gene 

2 0 expression. The observed pattern is compared to our database to determine whether it 
resembles a pattern associated with known carcinogens. Housekeeping genes are 
sequences such as p-actin cDNA because p-actin is assumed to be sjmthesized in both 
treated and untreated animals. We are not limited to P-actin as a control and can use 
any sequence, which is synthesized at the same levels in both tissues. Hybridization 

2 5 results are internally controlled by labeling the test cDNA using one detectable label 
that is distinguishable over the second detectable label, for example, by labeling the 
test cDNA with a fluorescent molecule emitting one color of light and the control 
cDNA labeled with a second fluorescent molecule emitting a second distinguishable 
color. 

30 (2) A second method would be to fix an array of cDNA on a glass slide or 

other ceramic substrate. The labeling, hybridization, and analysis are performed 
similarly to that described supra in order to determine the pattern of synthesis for a 
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sample. An array of DNA sequences could easily be prepared using techniques 
available to one of skill in the art. For example, U.S. Patent 5,445,934 teaches a 
method and apparatus for preparation of a substrate containing a plurality of 
sequences. Photoremovable groups are attached to a surface of a substrate. Selected 
5 regions of the substrate are exposed to light so as to activate the selected areas. A 
monomer, also containing a photoremovable group, is provided to the substrate to 
bind at the selected areas. The process is repeated using a variety of monomers such 
as amino acids until sequences of a desired length are obtained. Detection methods 
and apparatus are also disclosed. Also, U.S. Patent 5,700,637 teaches an apparatus 

1 0 and method for analyzing a polynucleotide sequence; either an unknown sequence or 
a known sequence. A support, e.g. a glass plate, carries an array of the whole or a 
chosen part of a complete set of oligonucleotides which are capable of taking part in 
hybridizd.tion reactions. The array may comprise one or more pair of oligonucleotides 
of chosen lengths. The polynucleotide sequence, or fragments thereof, are labeled and 

1 5 applied to the array under hybridizing conditions. Applications include analyses of 
known point mutations, genomic fingerprinting, linkage analysis, characterization of 
mRNAs, mRNA populations, and sequence determination. 

A multiplicity of nucleotide sequences arrayed on a substrate may also be 
commercially available, for example, from Clontech Laboratories. 

20 (3) A third method would be to denature and array the cDNA sequences for 

our set of test sequences on two sets of membrane filters. Radiolabeled cDNAs are 
prepared from the test RNA and the control and each is hybridized to a set of 
membrane filters. After washing, the filters are either exposed to x-ray film or 
analyzed in a phosphoimaging system to measure the amount of cDNA hybridized to 

2 5 each set of membranes. The hybridization is normalized for housekeeping genes and 

the pattern of synthesis determined by the relative radioactivity over each cDNA. 
This determination is made by comparing a test filter with its control counterpart. A 
further alternative is to employ two separate radioisotopes for labeling the cDNA 
prepared fi-om test RNA and cDNA prepared from the control RNA. 

3 0 The skilled artisan would readily appreciate from the present disclosure, that 

other detectable labels, including enzymes, such as horse radish peroxidase or alkaline 
phosphotase may be used accordingly. 
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III. Use of Protein Synthesis Patterns 

The present invention further contemplates that the changes in translation of 
genetic sequences as well as the changes in transcription of genes, as has been 
5 discussed supra, may be used as a measure or characteristic of whether a test agent 
functions as a carcinogen. Changes in the amounts or levels of expression of the 
genetic end products ~ proteins- as translated from the mRNA of mammalian cells 
may also be determined. However, the present invention should not merely be limited 
to the detection of differences in the level of protein synthesis, either increased or 

10 decreased, in a tissue from treated animal cells since postranslational modification 
could also be affected by carcinogen treatment. These postranslational modifications 
include: cleavage of the protein into smaller units; degradation to short peptides or 
amino acids; phosphorylation; alkylation; deamidation; glycosylation; as well as any 
other postranslational modifications of the synthesized proteins. 

1 5 The methods available to measure difiference in protein synthesis or 

postranslational modifications are well known in the art. For example, two 
dimensional polyacrylamide gel electrophoresis separates proteins or peptides by net 
charge in the first direction and by size in the second. Proteins or peptides are 
visualized by staining or autoradiography and analyzed using image analysis software. 

20 Proteins can be displayed by two-dimensional gel electrophoresis. A sample 

of protein to be displayed is loaded onto an isoelectric focusing gel and the proteins 
allowed to reach equilibrium. A lane of this gel, narrow strip, or tube gel is laid on 
top of an SDS Page slab gel and the proteins previously separated by isoelectric point 
now are separated by size. The proteins thus displayed can be detected by staining or 

25 by autoradiography if previously radiolabeled. Clearly the proteins displayed from 
any sample will depend on the sample preparation, pH range of the focusing gel, and 
the polyacrylamide concentration of the size separation gel. 

The proteins differentially expressed early in test compound treatment can be 
identified by comparing two dimensional gels run on proteins extracts of tissues from 

3 0 treated and untreated animals. A protein that is overexpressed upon treatment will 
show up as a stronger spot on the treated gel and underexpressed proteins will be 
lighter than the corresponding control spot. The protein in question can be further 
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identified by ttypsin (or other specific cleavage treatment) and analysis by mass 
spectrometry. The precise molecular weights of the peptides can be matched to 
peptide databases to make the identification. If one peptide appears in more than one 
protein, additional peptides can be analyzed to complete the identification. 
5 An alternate method of analyzing proteins would be through the use of mass 

spectroscopy to characterize proteins or peptides by their mass. This characterization 
could be coupled to a method of separation of proteins, including capillary 
electrophoresis or liquid chromatography. The separation or filtration step or the two 
dimensional gells above, will limit the number of proteins or peptides characterized at 

10 any one time. Furthermore, the filtering step may be designed to emphasize those 
proteins most changed by the treatment with a carcinogen. Identification would be 
made by searching a molecular weight database. 

Additional filtering steps could be followed in the method of initial extraction 
of the proteins or peptides. Rather than examining the total proteins synthesized in a 

15 tissue, selected fractions could be analyzed. Such fractions may include: the isolation 
of soluble fractions; membrane fractions; salt precipitate fractions; or firactions 
obtained by any other treatment. 

Clearly the characterization of carcinogens by protein changes, either 
alterations in synthesis or modifications, will yield reference patterns of differential 

20 genetic expression or changes that are characteristic of carcinogens. These reference 
patterns may characterize a new carcinogenic agent if the pattern resulted from 
treatment of animal cells with (he new agent. 

Having now generally described the invention, the same will be more readily 
understood through reference to the following examples which are provided by way of 

2 5 illustration, and are not intended to be limiting of the present invention, unless 
specified. 
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EXAMPLE! 

Gene Expression Changes in TPA-Trcated and Terminally Differentiated Primary 

Mouse Keratinocytes Cultures 

1 . Primary mouse keratinocytes cultures 

5 Cultures of newborn mouse keratinocytes were established essentially as 

described by H. Hennings (In Keratinocyte Methods, Cambridge University Press 
(1994), pages 21-23). Briefly, mice two to three days postpartum were sacrificed by 
metaphane narcosis, rinsed in 70% ethanol, and kept on ice. Skins were peeled off, 
stretched out and floated (with epidermis facing upward) on the surface of a 0.25% 

10 trypsin solution in Ca*^, Mg'- free PBS overnight at 4°C. Skins were transferred to a 
sterile surface, the epidermis was separated from dermis, the epidermis was minced, 
and the mixture stirred in serum-free keratinocyte growth medium (KGM, Clonetics). 
This cell suspension was filtered through nylon gauze, cells were centrifuged (2,000 
rpm for 5 min), resuspended in KGM with 20 ^M Ca*^ (20 KGM), and then the cells 

1 5 were counted and plated (approximately 2x1 0' per 1 0 cm plate coated with mouse 
collagen type IV in 20 KGM) and the cell cultures were incubated at 37°C with a 7% 
C0{ After 16-24 hours, cells were washed in Ca*^, Mg*^ free phosphate buffered 
saline (PBS), and cultivated on 20 KGM, with the cell culture medium changed (that 
is, old medium discarded and fresh medium provided) each other day. For induction 

20 of terminal differentiation, medium was changed, usually on the sixth day of 
cultivation of the keratinocytes, to KGM containing 1.2 mM Ca*^, and the 
keratinocytes were cultivated for 24 hours before RNA isolation. 

For phorbol-myristate acetate (TP A) treatment, keratinocyte cultures were fed 
with 20 KGM containing 0.1 fig/ml of 12-0-tetradecanoyil-13-phorbol-acetate for 24 

25 hours, before isolation of RNA. 

2. Differential display of mRNAs synthesized in control keratinocytes, 
Ca"^" differentiated keratinocytes and TPA-treated keratinocytes. 

Total RNA was isolated by acid phenol procedure using TRI Reagent™ 
30 (Molecular Research Center, Inc.) and according to manufacturer's instructions from 
(1) terminally differentiated keratinocytes, cultivaced in KGM medium containing 
1.2mM Ca*^ for 24 hours; (2) TPA-treated keratinocytes, cultivated in KGM 
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containing 20 ^M Ca*' and 0. 1 ng/ml TPA for 24 hours; and (3) control keratinocyte 
cultures fed 24 hours before RNA isolation with 20 KGM containing, in some 
experiments, 0.05% DMSO (dimethyl sulphoxide). 

Differential display of total RNAs from keratinocyte cultures (1), (2), and (3) 
5 was performed with Delta-RNA Fingerprinting Kit™ (Clontech) according to the 
manufacturer's instructions. Briefly, cDNAs were synthesized from total RNAs using 
Murine Moloney Leukemia Virus (MMLV) reverse transcriptase. Two dilutions of 
cDNA were prepared (diluted 10-fold and 40- fold). Each total RNA sample was used 
to synthesize cDNA that was amplified using polymerase chain reaction (PCR) 

1 0 methodology using P & T primers in presence of [a-"P] dATP. These primers ai e 
listed in Figure 2. The labeled amplified bands were displayed on sequencing gels 
and the differential appearance of labeled bands for each primer pair was examined. 
The appearance of a band from a sample that was not present in the control lane 
indicated overexpression and the dis^pearance or reduction in intensity of a band 

15 indicated suppression of synthesis. The candidate bands were cut from the 
sequencing gel and reamplified using the original primer pairs. Following 
electrophoresis on agarose gels, reamplified DNA fragments were cut out and purified 
using a QIAEX II gel extraction kit™ (QIAGEN). The DNAs were labeled and used 
to confirm the differential expression by hybridization with Northern blots of Poly 

2 0 (A)' containing RNA isolated from total RNAs (1 ),(2), and (3). Equal quantities of 
these RNAs were electrophoresed on agarose gels and blotted to nitrocellulose filters. 
Equal loading of RNA was confirmed by hybridization with a P-actin probe, whose 
mRNA was assumed to be present in approximately equal amounts in all of the 
preparations. The blots were stripped of radiolabel and then used to confirm the 

25 differential synthesis of a fragment. Upon confirmation, the fragment was cloned into 
pCRII using the TA cloning kit™ (Introvitrogen). Cloned DNA fragments were 
sequenced with SequiTerm Excel II kit™ (Epicentre Technology). 

The genes or ESTs identified by differential display are presented in Table 1 . 
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Table 1. Difierentiaily Expressed Genes in TPA Treated Keratinocytes, 
Identified By Diflierential Display and Northern Blot 
Hybridization 





StseofClondd 






Gem 


Fragmenf, bp 




110176/4 


200 


Suppressed in TPA- Treated 
Cells 


Mouse Cytokeratin Endo A 
(156/156;100%) 


2672/1 


550 


Suppressed in TPA- Treated 
Cells 


Mouse ESTs AA492919 
(124/127:97%). AA607761 
(93/97;95%); Human Dnase 1 
Homologue HS U62647 

\f\\lOOj( f /o) 


31777 


650 


Overexpressed in TPA- 
Treated Cells 


Mouse EST AA073833 (67/67; 
100%) 


19102/3 


320 


Overexpressed in CA+2- 
Treated ans Suppressed in 
TPA- Treated Cells 


Mouse ESTs /!kA03861 3 
(251/251 ;100%), W14382 
(212/213:99%) 


3177715 


650 


Suppressed in TPA- Treated 

Cells 


Mouse ESTSAA423711 
((156/156:100%), and AA637015 

(96/98:97%) 


83442 


550 


Overexpressed in TPA- 
Treated Cells 


Mouse Connexin 26 (160/161; 
99%) 


31774/6/8 


650 


Suppressed in TPA- Treated 
Cells (6 kb RNA) 


Mouse ESTSAA423711 
(186/186:100%), W82493 
(198/203:97%; and 52/53; 98%. 
AA637001 591 32/1 34:98%) 


4314 


1000 


Overexpressed in TPA - and 
Ca+2- Treated Cells 


AA109991 (200/201 :99%). 
AA561 91 4(200/201 ;99%) 
AA067670 (139/140; 99%), 
AA8641 27 (133/134:99%) 


2552/4 


650 


Suppressed in TPA- Treated 
Cells (5.5 kb RNA) 


ND 



3 . Hybridization of cDNA to filters containing mouse DNA sequences 
Atlas™ Membranes (Clontech) contain mouse cDNA in paired spots. Each 
pair cDNA, included gene sequences from known oncogenes, tumor suppressors, cell 
cycle regulators, stress response genes, ion channels and transporters, intracellular 
signal transducers, apoptosis genes, DNA synthesis, DNA repair, DNA 
recombination, transcription factors, general DNA binding proteins, receptors, cell 
surface antigens, cell adhesion, cell-cell communication, cytoskeleton, mobility, 
protein turnover, and housekeeping proteins. Each membrane filter contained 588- 
mouse cDNA's, as well as nine housekeeping control cDNAs, and negative control 
sequences. We have used pairs of these filters to select additional gene sequences, 
which are differentially expressed in the keratinocytes. 
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Poly(A)*, containing RNAs were prepared from total RNAs isolated from 
control (untreated), TPA-treated and terminally differentiated primary mouse 
keratinocytes as discussed supra. Labeled cDNAs were synthesized with reagents 
provided in the Atlas™ Kit (Clontech) and used for hybridization with Atlas™ 
5 membranes (Clontech) according to manufacturer instructions. An experimental 
cDNA was always compared to control cDNA. The filters were washed and exposed 
to X-ray films for varying lengths of time. Estimation of equal hybridization to 
housekeeping genes was used as normalization so that gene sequences that are over or 
under-expressed could be determined by the difference in intensity of the same spots 
10 on an experimental filter and a control. 

Two and three technicians analyzed sets of radioautograms exposed to pairs of 
Atlas™ membranes (ClonTech) hybridized to experimental and control cDNAs and 
showing equal signals for housekeeping genes. Only genes that were consistently 
identified as overexpressed or suppressed genes comparing control radioautograms, as 
1 5 shown in Figure 1 , are provided in Tables 2 and 3. 
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Table 2 DifTerentially Expressed Genes In TPA Treated Keratinocytes, 

Identified by Hybridization With Atlas™ cDNA Array(s) (Membranes) 

















pPrhA rtnf*Anono' Xh\/rrtiri hnrmrwip rpi^ntnr 


rMNM r uiynieraSQ * isiniinoiiun laciur i i r- 1 


cjun proiu-uncoyene 






wor~ 1 , cojony siiniuiGiuny lauior i 




Cdk4 


p18inc4; cdk4 and cdk6 inhibitor 


Oxidative stress-Induced protein 


Golgi 4 b^nsmembrane spanning transporter 


l-Kappa B alpha chain 


Glucose transporter-1 


Protein l<inase C theta type 


lnterleukin-6 receptorbeta chain 


Transducin beta-2 subunit 


STAT6; {IL-4 Stat) 


Bax protein; homologue of Bcl-2 


Syp; adaptor protein tyrosine phosphatase 


GST Pil; glutathione S-transferase Pi 1 


Gluthathione S-transferase (microsomal) 


Chop 10; nomologue of Gadd153 


Gluthathione S-transferase Mul 


TDAG51; couples TCR signaling to Fas 

expression 


Proteine tyrosine phosphotase 


Activating transcription factor 4 


Butyrate response factor 1 


Homeo box protein 8 


CACCC Box binding protein BKLF 


CD44 antigen 


DP-1 (DRTF- polypeptide 1)ceil cycle regulatory 
transcription factor 


Integrin alpha 2 


HMG-14 


integrin beta 


Dystroglycan 1 


VLA-3 alpha subunit 


Fpllistatin 


Inhibin beta subunit 


Cytokeratin 1 


Cytokeratin 18 


TIMP-3 tissue inhibitor of metailoproteinase 3 


Cathepsin 
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Table 3 DifTerentially Expressed Genes is Terminally Differentiated 

Keratinocytes Identified by Hybridization With Altas ™ cDNA Array 
Membranes 







Ezrin (Viilin 2) 


Tumor suppressor maspin 


^ 1 igni junciion proieinj 


CHTiyc proio-uncoyenc 


Cot proto-oncogene 


Cyclin G 


Casein kinase II (alpha Subunit) 


Prothymosin 3lph3 


N-ras proto-oncogene 


DO 1 -1 , uuoo antigen 


pioinK4; caK4 ana cokd tnniDitor 


GST Pi 1; glutathione S-transfererase Pi 1 
preadipocyte growth factor 


Glutathione reductase 


UAU 1 1 Q6TenQ6r ayainsi c^ii u6ain i 


OAL>v^L> Dox- Dinuiny pr oiein 


Or iO( S6nn6 pruieiiiase inniuiior 


Hnmpnhny nrnt^in R 


Rad23 homnloaue 

IVII irXfcwUf lAOU&w 1 IWI 1 IwlVUUQ 


Retinoic acid binding protein II 
(CRABP-II) 


Ubiquitin-conjugating enzyme 


Insuiin-iike growth factor binding 
protein-1 (IGFBP-1) 


YY1 (UCRBP) transcription factor 


Cytokeratin 18 


CD44 antigen 




Dystroglycan 1 




Neuroleukin 




Cytokeratin 14 




Cathepsin H 




Cathepsin L 



5 4. cDNA subtraction and hybridization with Atlas™ membranes 

(Clontech) 

cDNAs were synthesized from poiy(A)* RNA isolated from untreated control 
and TPA-treated mouse keratinocytes and subtracted using PCR-Select™ cDNA 
subtraction kit (Clontech) following by the manufactures instructions. cDNA from 

10 TPA-treated keratinocj^es that was subtracted against cDNA from untreated cells was 
labeled by random primer labeUng protocol and used in hybridization studies on 
Atlas™ cDNA membranes. These studies identified genes that were overexpressed in 
TPA-treated keratinocytes. Similarly, cDNA from control keratinocytes was 
subtracted against cDNA from TPA-treated keratinocytes. Varying the time of film 

15 exposure, normalized hybridization signals for housekeeping genes were obtained and 
autoradiograms were analyzed by two technicians. A list of genes overexpressed and 
suppressed upon TPA treatment of mouse primary kCTatinocytes is shown in Table 4. 
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Table 4 Differentially Expressed Genes in TPA Treated Keratinocytes Identified 
by Hybridization of the Subtracted cDNAs with Mouse Atlas^" cDNA 
Array Membranes 











BST-1 ; lymphocyte differentiation antigen 
CD38 


c-ErbA oncogene (Thyroid hormone receptor) 


l-Kappa B sipha chain 


H-ras proto-oncogene 


Rac 1 murine homologue 


Cyclin D1 


Zyxin; LIM domain protein;-actin binding 
protein 


Cyclin D2 


Sax protein (Bcl-2 homologue) 


Prothymosin alpha 


GST Pi 1; glutathione S-transferase; 
preadipocyte growth factor 


Hsp27; heat shock 27KD- protein 1 


c-Act protoO oncogene; Rac-alpha; 
proteine kinase B 


Dystroglycan 1 


Nuclear factor related to p45 NF-E2 


Insulin- like growth factor binding protein 6 (IGFBP 

6) 


Monocyte chemoattractant protein 3 


TIMP-3 tissue inhiditor of metalloproteinases-3 


lnterleukin-2 receptor gamma chain 




CD1 8 antigen beta subunit 




CD44 antigen 




CD14 antigen 




Glutamate receptor channel subunit 
gamma 




Bone morphogenetic protein 1 




Macrophage inflammatory protein 




Macrophage inflammatory protein 1 (Act2) 




Macrophage inflammatory protein 2 alpha 




Interleukin 1Beta 




Cytokeratin 14 




Cytokeratin 1 




Cathepsin L 




Collagenase type tV 




Plasminogen activator inhibltor-2 




Serine protease inhbitor 2 (spi-2) 





EXAMPLE 2 

Gene Expression Changes in Skin and Liver of DMBA-Treated Mice 
1 . Skin treatment of mice with DMBA 

DMBA (7, 12-dimethyl benz[a] anthracene) is a potent carcinogen that causes 
tumor formation on the skins of CjBgF, mice in just over six months of chronic 
treatment. Because of its potency at causing tumors it was chosen as the first 
compound to test in animals for the change in pattern of gene expression following a 
short period, two weeks, of carcinogen treatment. 
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Approximately 5-sq. cm area was shaved on the backs of CjBjF, mice. Three 
males and three females were treated for ten weekdays with 250 lag of DMBA in 100 
|il of acetone. Controls were treated with 100 jil acetone alone. 

The next weekday following the tenth treatment, the animals were sacrificed 
5 and the following tissues removed and flashed fi'ozen in liquid nitrogen; treated skin, 
liver, bladder, lungs, and stomach. 

2. Differential gene expression in mouse skin treated with DMBA 
Since the treated tissue is the site of maximum tumor formation, this tissue 
1 0 was examined first. Total RNA was extracted irom skin by the Trizol acid phenol 
procedure after grinding the tissue to a powder in liquid nitrogen. PolyA RNA was 
isolated with oligeotex mRNA kit^w (QIAGEN). 

Radiolabeled cDNA was prepared from pooled DMBA treated skin Poly/.' 
RNA and from the pooled control PolyA* RNA, as discussed supra. Approximately 
1 5 equal amounts of each was hybridized to an Atlas™ membrane, the filter was washed, 
and exposed to X-ray film. 

After development of several exposures the films were compared and a treated 
film and control film with approximately equal hybridization for the housekeeping 
genes were compared to determine which of the 588 mouse sequences on the filter are 
2 0 overexpressed or underexpressed following treatment. Genes foimd to be 

overexpressed and suppressed in DMBA treated skin are presented in Table 5. 
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Table S Differentially Expressed Genes in DMBA-Treated Mouse Skin 
Identifled by Hybridization witli Mouse Atlas™ cDNA Array 
Membranes 



Overfixpressed Genes 




IGFBP-2; insulin-like growth factor binding 
protein 2 


RNA polymerase 1 termination factor TTF-1 


HSP84 


ERp72; disulfide isomeraserelated protein 


Cf2r. thrombin receptor 


Etoposide induced p53 responsive {E124) mRNA 


Inhibitor of the RNA-activated protein Kinase 


Grp78; glucose regulated protein 


Limphotoxin receptor 


Glutathione S-transferase 


Laminin receptor 1 


Glutathione S-transferase Mu 1 


snoN; ski-related oncogene 


Butyrate response fector 1 


Neuoleukin 


Egr-1 Zn-finger protein 


Placental Rnase inhibitor 


HMG 14 


Serine protease inhibitor 2 (spl-2) 


Nuclear factor related to P45 NF-E2 




Split hand/foot gene 




Transcription factor S-ll 




IGFBP-4; insulin-like growth factor binding 
protein 4 




Interleukin 15 



Differential gene expression in DMBA-treated mouse skin was also probed by 

hybridization with Mouse Gene Discovery Arrays (GDA) (Genome Systems, Inc.) 

containing 1 8,378 double-spotted none-redundant mouse cDNA clones chosen fix)m 

the I.M.A.G.E. collection. 2.5 pg of polyA*-RNA from DMBA-treated and control 

mouse skin was used for cDNA labeling and hybridizationwith two GDA membranes 

according to the manufacturer instructions, and obtained Phosphorlmager files were 

sent to and analyzed by Genome Systems, Inc. service. As shown in Table 6, 599 

cDNA clones display 3 or more fold differences in expression in DMBA-treated skin 

compared to acetone-treated skin among clones with top 1000 scores (calculated by 

multiplying ratio of hybridization intensity on difference in average - for two spots - 

hybridization intensity for corresponding cDNA clones; hybridization intensity for 

cDNA clones was normalized for intensity of hybridization of control DNA spots). 

Table 6. Number of cDNA Clones with Changes in Expression in DMBA- 
Treated Mouse Skin 



Difference in Expression 


Number of Genes 




Overexpressed 


Suppressed 


More than 3-fold 


285 


122 


More than 5-fold 


50 


24 


More than 9-fold 


12 


6 



10 
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3. Differential gene expression in livers of DMBA treated mice 
RNA was extracted irom mouse liver, which had been previously ground into 
a fine powder in liquid nitrogen. Poly(A)'^ RNA was prepared firom total RNA and 
radiolabeled cDNA prepared, as discussed supra. The cDNAs from the liver of 2 
5 control animal and a treated animal were each hybridized to an Atlas™ membrane and 
the differential pattern of gene expression deduced. While initially the filters showed 
little or no differential expression, upon more careful examination differential 
expression was observed indicating that the DMBA or a derivation of it had reached 
the liver and caused changes in gene expression. These results are presented in Table 
10 7. 



Table 7. Differentially Expressed Genes in Liver of DMBA-Treated Mice 
Identified by Hybridization With Mouse Atlas^** cDNA Array Membranes 



Overexpressed Genes 




p53 protein 


Cyclin A 


Lfc proto-oncogene 


Prothiymosin alpha 


CSF-1 


Cf2r; coagulation factor II (thrombin) receptor 


Cdk4 


CRE-BP1; cAMP response element binding 
protein 1 


Heat shock protein 84KD 


Gluthathione S-transferase (microsomal) 


Etoposide induced p53 responsive (E124) 
mRNA 


Elf-1 Els family transcription factor 


l-Kappa Beta 


HMG-14 


SP13; serpin 


HomeoBox protein 2.5 


CACCC Box binding protein BKLF 


Follostatin 


Homeo box protein 8 


Membrane type matrix metalloproteinase 


lnterleukin-2 receptor gamma chain 




Glucocorticoid receptor fonm A 




Bone morphogenetic protein 1 




Cathepsin D 




Catliepsin L 




Coliagenase type iV 





15 EXAMPLES 

Gene Expression Changes in Liver of Mice Treated with Furan and Trichloroethylene 
1 . Treatment of mice with Carcinogens. 

Furan and trichloroethylene (TCE) are shown to induce hepatocellular 
carcinomas in mice; both are negative in Salmonella mutagenicity test though furan 
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induces gene mutations, and sister chromatide exchanges and chromosomal 
aberrations in mammalian cells in vitro. CjB^F, Mice were treated with either furan 
(30 mg/kg/dose) or TCE (2400 and 1800 mg/kg/dose for males and females 
respectively) by gavage in com oil. The control mice were treated with com oil alone. 
5 The study contained ten animals of each sex in each treated and control group. The 
treatment was carried out for ten working days, with no treatment on weekend days 
for the carcinogen-treated animals and for nine days of control group. The treatment 
was carried out for ten working days, with no treatment on weekend days for the 
carcinogen-treated animals and for nine days for control group. Animals were 

10 sacrificed humanely and tissues were snap-frozen inl liquid nitrogen. 

2. Differential gene expression in liver of mice trated with furan and 
TCE. Total and (polyA+)RNAs isolation, labeling of cDNA, hybridization with 
Atlas"'" cDNA Expression Arrays (Clontech) and analysis of the hybridization results 
were performed as described above for differential gene expression in DMBA-treated 

15 mouse skin. List of genes that showed changes in expression in liver of fiiran and 
TCE-treated mice is presented in Table 8. 
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1 . An assay for early detection of carcinogenesis, comprising the steps of: 

a) preparing cDNA from RNA isolated from mammalian cells treated 
5 with a test agent forjess than about two years, wherein said cDNA is detectably 

labeled; 

b) determining a pattern of expression by assessing the ability of said 
labeled cDNA to hybridize to each of a multiplicity of nucleotide sequences arrayed 
on a substrate; 

10 c) determining reference patterns of differential gene expression by 

assessing the ability of a second labeled cDNA to hybridize to each of a multiplicity 
of nucleotide sequences, wherein said second labeled cDNA is prepared from RNA 
isolated from mammalian cells treated with a known carcinogen for less than about 
two years and 

15 d) comparing similarities in patterns of expression from cells treated with 

a test agent to reference patterns from cells treated with a known carcinogen. 

2. The assay of claim 1 , wherein RNA is isolated from mammalian cells 
treated with a test agent or known carcinogen for less than about six months. 

20 

3. The assay of claim 2, wherein RNA is isolated from mammalian cells 
treated with a test agent or known carcinogen for less than about two weeks. 

4. The assay of claim 3, wherein RNA is isolated from mammalian cells 
2 5 treated with a test agent or known carcinogen for less than about two days., 

5. The assay of claim 1 , wherein said mammalian cells are treated in 

vitro. 



30 



6. 



The assay of claim 1 , wherein said mammalian cells are treated in vivo. 
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7. The assay of claim 1, wherein said substrate is selected from the group 
consisting of: membrane filter; glass; ceramic; and solid organic polymer. 



8. The assay of claim 1 , wherein said detectable label is selected from the 
group consisting of: radioisotope; fluorescent dye molecule; enzyme; antigen or 
antibody. 

9. The assay of claim 1, wherein said differential gene expression is 
selected from the group consisting of: genes whose level of expression is increased; 
genes whose level of expression is decreased; gene that are expressed; and genes that 
are not expressed. 

10. A method to identiiy a test substance as a carcinogen, which method 
comprises: 

a) providing cDNA derived from mammalian cells that have been 
contacted with said test substance for less than about two years, 

b) determining a first pattern of hybridization of said cDNA with respect 
to a multiplicity of nucleotide sequences; 

c) comparing said pattem with a corresponding second pattern obtained at 
a corresponding time after said cells have been contacted with a known carcinogen, 
whereby a similarity in said first and second patterns identifies the test substance as a 
carcinogen. 

1 1 . The method of claim 1 0, wherein cDNA is obtained from said cells hat 
have been contacted with said test substance or known carcinogen for less than about 
six months. 

12. The method of claim 1 1 , wherein cDNA is obtained from said cells 
that have been contacted with said test substance or known carcinogen for less than 
about two weeks. 
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13. The method of claim 12, wherein cDNA is obtained from said cells 
that have been contacted with said test substance or known carcinogen for less than 
about two days. 

5 14. The method of claim 1 0, wherein said cells are contacted with a test 

substance or a known carcinogen in vitro. 

1 5 . The method of claim 1 0, wherein said cells are contacted with a test 
substance or a known carcinogen in vivo. 

10 

16 An assay for early detection of carcinogenesis, comprising the steps of; 

a) selecting protein isolated from mammalian cells treated with a test 
agent for less than about two years; 

b) measuring differences in protein synthesis or post-translational 
15 modification; and 

c) determining whether said test agent is carcinogenic by comparing 
differences in protein synthesis or post-translational modification to protein isolated 
from maoMnahan cells treated with a known carcinogen for less than about two years. 

20 17. The method of claim 1 6, wherein said differences in protein synthesis 

or post-translational modification are measured by electrophoretic gel analysis and 
said protein is detectably labeled. 

1 8 The method of claim 16, wherein said differences in protein synthesis 
25 or post-translational modification are measured by enzymatic digestion of protein and 
mass spectrometry analysis. 

1 9. The method of claim 16, wherein said protein is isolated from said 
cells treated with a test agent or known carcinogen for less than about six months. 

30 

20. The method of claim 19, wherein said protein is isolated from said 
cells treated with a test agent or known carcinogen for less than about two weeks. 
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2 1 . The method of claim 20, wherein said protein is isolated from said 
cells treated with a test agent or known carcinogen for less than about two days. 

5 22. The method of claim 16, wherein said cells are treated with a test 

substance or a known carcinogen in vitro. 

23. The method of claim 16, wherein said cells are treated with a test 
substance or a known carcinogen in vivo, 

10 

24. Protein measured by any one of claims 16 to 23. 

25. DNA used in any one of claims I to 15 that is predictive that a test 
agent or test substance is a carcinogen. 

15 

26. An assay kit for identifying a test substance as a carcinogen, 
comprising: 

a) reagents for the preparation of cDNA prepared from RNA isolated 
from mammalian cells treated with a test substance for less than about two years; 
20 b) reagents for the preparation of cDNA prepared from RNA isolated 

from mammalian cells treated with a known carcinogen for less than about two years; 

c) DNA sequences arrayed on a substrate that are hybridizable to said 
cDNA; and 

d) label suitable for the detection of said DNA sequences hybridized to 
25 saidcDNA; 

wherein said identification of a test substance as a carcinogen is achieved by 
creating a first pattern of differential gene expression using cDNA prepared from said 
RNA isolated from mammalian cells treated with a test substance for less than about 
two years and comparing said first pattern with a corresponding second pattern 
3 0 obtained at a corresponding time after said cells have been contacted with a known 
carcinogen, whereby a similarity in said first and second patterns identifies the test 
substance as a carcinogen. 
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P-Primers Pi: 5'.attaaccctcactaaatgctqgqqa-3* 

P2: 5'-ATTAACCCTCACTAAATCQQTCATAQ-3" 
P3: 5'.ATTAACCCTCACTAAATGCTGGTGG-3" 
P4: 5'-ATTAACCCTCACTAAATGCTGGTAQ-3* 
P5: 5'-ATTAACCCTCACTAAAQATCTGACTG-3* 
P6: 5'-ATTAACCCTCACTAAATGCTGQGTG-3' 
P7: 5'-ATTAACCCTCACTAAATGCTQTATG-3* 
P8: S'-ATTAACCCTCACTAAATGGAGCTGG-3* 
P9: S'-ATTAACCCTCACTAAATQTGGGAGQ-3' 
P10: 5'-ATTAACCCTCACTAAAQCACCQTCC-3* 



T-Primers T1: S'-CAmTGCTGAGTGATATCTTTTTTTTTAA^ 

1 rrimers ^ g^ATTATGCTGAGTGAT ATGTTrrnTTT AOS- 

T3: S'^ATTATGCTGAGTGATA TCTTnTTTTTA Q^ 
T4: g-CATTATGCTGAGTGATAT Cl I ill III l CA-3' 
T5: S'^ATTATGCTGAGTGATAT CI 1 1 1 M 11 I COg 
T6: g-CATTATGCTGAGTGATAT CI 1 1 1 U 1 1 I C G'3' 
T7- g-CATTATGCTGAGTGATATC l H I III 1 I GA-S* 
T8: 5'<;ATTATGCTGAGTGATAT CTTTTmTT GC-3' 
T9: ff-CATTATQGTQAQTGATATGI I III I H IGG-ff 
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